-
Notifications
You must be signed in to change notification settings - Fork 6.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Revert 73978 & 74096 due to regressions #75244
Conversation
PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 This reverts commit e9b676a. Signed-off-by: Alberto Escolar Piedras <[email protected]>
This reverts commit b18cad1. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
…iour" This reverts commit b10f1ca. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
…oup" This reverts commit 308322e. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
…port" This reverts commit b2243af. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
This reverts commit be086f1. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
This reverts commit b82b5b0. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
This reverts commit d9855da. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
This reverts commit 499a633. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
This reverts commit 48dff55. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
This reverts commit 581a0f5. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
This reverts commit 305ec62. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
This reverts commit 49ac191. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
This reverts commit 93973e2. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
This reverts commit 2d72966. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
This reverts commit 86b9293. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
This reverts commit 6f62292. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
This reverts commit a9a909c. PR zephyrproject-rtos#73978 introduced a regression. Unfortunately this PR cannot be reverted without reverting also Let's revert both PRs to stabilize main again towards the 3.7 release. For more details on the issue see zephyrproject-rtos#75205 Signed-off-by: Alberto Escolar Piedras <[email protected]>
The revert is needed to unblock development and testing, please resubmit with a proper fix and more testing, we will then reconsider that. If you need to escalate, please bring this to the TSC today.
@aescolar - that's reassuring, although re-submitting might require fixing bugs in other code as well, and that could take time. What is the deadline for resubmitting prior to tagging v3.7.0? Will there be some flexibility? Care to list some change requests?
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1 to reverting this late in the game
I would be happy to remove my change request to this PR when there is a concrete answer to these questions. @rlubos has already messaged me and confirmed that the |
Addressed @nashif's remarks in private. My position is that certainly if
Then there is something wrong in testing. Please feel free to suggest a test plan, and as maintainer of the testing area and twister, you should be one of the more qualified individuals to make that recommendation. |
There is no reason to address my remarks in private, I have seen then coming on discord but I will not read them. The place for doing this should be the PR and not private messages. Sorry. |
The many reviewers who approved the PRs approved them without a condition attached. The TSC also approved the PRs for 3.7.0. Due diligence was done by myself as author, tester, all of the reviewers who reviewed these PRs as well as the TSC.
"Not tested well enough"? Sorry, I don't have access to every board supported by Zephyr. However, when I run tests on platforms I have access to, and then build for all platforms with Who knows how many hundreds of thousands of test permutations are executed. Please define "not tested well enough". Maybe you could elaborate on what you feel proper testing is with concrete suggestions? A fix has been provided in #75348 and is significantly less destructive than this PR. It's quite compact. I invited @aescolar to find a test or sample that breaks. So far, nothing.
It makes a difference because you, the Linux Foundation, and others should probably care care about maintainer work-life balance. It really should be a concern - for all of us. We have had community members away for medical reasons recently. A friend in the LF community that does similar work died of a heart attack a few days ago. We have community members from all walks of life, in all circumstances. 🌈 People are different. People can be treated differently. It's OK to do that 👍
I need write the tests and run them on platforms which i have access to, and then build them for all platforms that I can build them for and then submit PRs for review. I'm not maintainer of our CI, testsuite, or tooling. I don't pretend to know every detail about why twister is not working as expected.
It was verified outside over and over by myself and others, it was verified in CI several times spanning weeks. As mentioned yesterday in the release meeting, clearly, tests were getting skipped when they should not have been. What I'm really not kidding; please, actually respond with the You are the maintainer of it so you should have a much better idea than me.
Again, there are only 24 hours in the day / 365 days in the year. Patches welcome! I've been cleaning up technical debt in the posix, C, C++, kernel, net, and other areas what - 4 years I think? If it could have been done overnight, I would have done that. However, it often takes weeks / months to get a single PR reviewed. Please do not be so dismissive. People have different amounts of bandwidth that they can spend working on Zephyr.
You seem to have a lot of insight into how my time should be managed 🤣
Half-baked? 🤣 ... Wow. It's a good thing I know you don't intend to come off as dismissive. |
your position is flawed. You are making this about testing. The fact is, testing DID catch the issue and a bug was reported., that why we have a multi-layered test approach. Whatever it is, the fact is that this PR is causing issues, whether caught by CI or not, the fix provided as commented by you...
so, there is a better change, hence my comment about half-baked fix above. |
The importance of this from @cfriedt perspective is a general concern I have and have voiced in another forum and that is the importance of this particular release. There are consumers of Zephyr that tend to stay on an LTS to the next LTS. Given our current guidelines, this branch is now frozen for the next 2+ years with only bug/security fixes being allowed. Any other release it is easy to say "we will get it in the next release"... not such an easy thing to hear on an LTS. Problems/lessons I see:
|
@nashif - sure after the fact
It seems testing is a little too selective.
It really did. Why would it have passed countless iterations of manual testing, as well as many iterations of CI, even after multiple rebases? So please, since you are the expert, tell me the manual testing to do so that I don't need to do waste twice as much of my time doing this again.
I can do that change too, I mean there is a clear path forward, I just want acceptance criteria to be agreed on in advance. |
Just to clarify, the TSC approved for the PRs to be included in 3.7 granted they are approved as per the usual review process -- it didn't give a blanket approval to the PRs themeselves. The same goes for all the PRs/exceptions voted upon during the June 26 TSC meeting. |
This seems kinda... energetic. Just to jump in with an opinion where we already have too many: what's the harm of waiting a few days to stabilize and merge the fix? Others are pointing out this stuff too, but just to summarize:
I mean, stuff breaks, people mess up. It happens, we've all been there. But I don't see much of an argument for ejecting the feature? |
I already said above, the reverted PR can be resubmitted again with a proper fix that addresses all issues and it will be re-considered. The exception granted by the TSC still valid, but this needs to be done within the next few days. |
yes, if you have a better way of doing this, please submit a proposal. Our CI has been operating this way for years.
We can't build every sample/test variant on each of the 800 platforms we have, this does not scale and each PR will take days to complete, that is where developers are tasked with picking the right coverage and defining where tests need to run. In this particular case the build issue can be reproduced on qemu_x86 by adding
see above...
because you did not modify the tests for the new feature and additions and change in header hierachy? Of course CI will pass if there is no test coverage for something that was just introduced.
If you do not know how to test your own features, no expert (and I am not claiming to be one) will be able to help. But I want to be productive here, I already spent a few minutes to find that enabling CONFIG_MBEDTLS or any others "users" might expose such issues in the future and would be good to have as a scenario. The socket issue I do not know, someone else need to provide their input. |
I only respond here because there is clearly some quite big misunderstandings driving some of the comments, and I hope that by clarifying them we can all move on and continue working on improving things.
Fetching the PR branch and running for ex.: Please note on top of what Anas said:
This revert PR was submitted: 4 days after the first bug report on this issue, 2 days after @cfriedt queued a draft PR. That draft PR was failing in several ways in CI and from my review it did not seem like something we could merge for 3.7. Several more bug reports had been raised. And there had been multiple reports of PRs and developers blocked by these issues.
That 2nd hotfix was rejected in its own PR as it did not seem appropriate. Other issues which the original PR had introduced and which were unrelated to the hotfix had also been raised at that point.
Please note the following (this is not to blame, but to provide pointers which hopefully will help us all):
Moreover as @kartben and @nashif already mentioned, all the TSC did is approve an exception for the rule which prevents PRs with new features being merged after rc1. The TSC did not vote on the content or merit of the PRs. Moreover the TSC chair warned about the risk of merging PRs like this so late, a risk which materialized. The approval from the TSC was only thru a lack of objections. I don't recall any TSC member expressing their interest in this PRs being merged.
In that case those companies should have allocated resources to ensure this changes made it well in time for the release. The rc1 deadline has always been known. Please note all subsystems have features they would like to get in. And we'd all like as many features as possible in.
@cfriedt At the very least the bugs that were reported and linked against the original PR must be fixed. Please also try to improve the test coverage.
@andyross Several other PRs and developers were blocked before merging this revert. We are just 1.5 weeks from final freeze. There was more than one distint regression introduced by this PR. There was no reasonable fix in sight even after several days after the issues were reported. Please, let's just move on and continue fixing bugs. This whole issue has taken way too much time from too many people. |
@aescolar - there fully was a reasonable fix in sight (@rlubos even had a PR ready to go for the network config). You need to be willing to see the solution without dismissing it because it came from an engineer you dislike 🤷♂️ |
PR #73978 introduced a regression.
Unfortunately this PR cannot be reverted without reverting also
#74096.
Let's revert both PRs to stabilize main again towards the 3.7 release.
For more details on the issue see
#75205
Note during the revert cea6bf5 conflicted, but it was merged taking the change in cea6bf5
Fixes #75205
Fixes #75364
Note the checkpatch error "Error: lib/posix/options/fs.c:89:WARNING: Violation to rule 21.2 (Should not used a reserved identifier) - rename" is the exact same as in the orinignal PR, as the diff contains the same lines just reverted.